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Congenital amusia is a musical disorder that mainly affects pitch perception. Among 
Mandarin speakers, some amusics also have difficulties in processing lexical tones (tone 
agnosics). To examine to what extent these perceptual deficits may be related to pitch 
production impairments in music and Mandarin speech, eight amusics, eight tone agnosics, 
and 12 age- and IQ-matched normal native Mandarin speakers were asked to imitate music 
note sequences and Mandarin words of comparable lengths. The results indicated that both 
the amusics and tone agnosics underperformed the controls on musical pitch production. 
However, tone agnosics performed no worse than the amusics, suggesting that lexical tone 
perception deficits may not aggravate musical pitch production difficulties. Moreover, these 
three groups were all able to imitate lexical tones with perfect intelligibility. Taken together, 
the current study shows that perceptual musical pitch and lexical tone deficits might coexist 
with musical pitch production difficulties. But at the same time these perceptual pitch 
deficits might not affect lexical tone production or the intelligibility of the speech words 
that were produced. The perception-production relationship for pitch among individuals 
with perceptual pitch deficits may be, therefore, domain-dependent. 
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INTRODUCTION 

Successful communication relies on the seamless integration of 
auditory perception and vocal production. It is widely acknowl- 
edged that auditory perception strongly affects vocal production. 
To a certain extent, impaired auditory perception hinders vocal 
production (Peng et al, 2004; Han et al, 2007; Xu et al, 2011 ). This 
is true for both music and language. As the two most important 
communication vehicles, music and language share a vital element, 
namely pitch. Exploring the impact of impaired pitch percep- 
tion upon pitch production across music and language domains is 
thus the key to understanding the influence of impaired auditory 
perception on vocal production. 

In the last decade, a developmental perceptual pitch deficit 
known as congenital amusia (Peretz, 2001; Peretz etal., 2002) 
has increasingly attracted research attention. This characteristic 
acoustical pitch deficit (Hyde and Peretz, 2004; Pfeuty and Peretz, 
2010) was initially related to deficient musical pitch perception 
(e.g., Foxton etal., 2004), which occurs independently of neuro- 
logical trauma, mental retardation, autism, deafness, or lack of 
musical exposure. Subsequent research has suggested that its ori- 
gin not only is related to the impaired fine-grained pitch processing 
(Hyde and Peretz, 2003; Foxton et al, 2004), but also involves com- 
promised pitch working memory (Gosselin et al., 2009; Tillmann 
et al, 2009; Williamson and Stewart, 2010; Williamson et al., 2010; 
Albouy et al., 2013), timbre perception deficits (Marin et al., 2012), 
and emotional prosody perception difficulties (Thompson etal, 
2012). 

Recent studies have evidenced amusia's related pitch deficits 
in the speech domain, where pitch also plays an important role. 



Individuals with amusia (hereafter, "amusics") may demonstrate 
lexical tone deficits among speakers of tonal (Nan etal, 2010; 
Liu etal., 2012) and non-tonal languages (Tillmann etal., 2011). 
Likewise, amusics have been shown to suffer from parallel speech 
intonation problems, a finding that has held true among speakers 
of both tonal (Jiang etal, 2010) and non-tonal languages (Liu 
etal., 2010). One of our earlier studies showed that a minority 
subgroup of amusic Mandarin speakers also had difficulty with 
lexical tone discrimination and identification in Mandarin speech 
(hereafter "tone agnosics") (Nan etal., 2010). 

This cross-domain perceptual pitch deficit offers an ideal 
opportunity to understand the influence of auditory perception 
on vocal production in music and speech. The current study sets 
out to investigate to what extent these perceptual pitch deficits 
may be related to pitch production impairments in music and 
Mandarin speech among Mandarin speakers. Mandarin Chinese 
is a tone language which relies on pitch variations to alter the 
meaning of words. Although amusic individuals who speak non- 
tonal languages also demonstrate similar problems with lexical 
tones (Tillmann etal, 2011), to study amusics and tone agnosics 
among tone language speakers wQl present unique perspectives 
on the impact of impaired pitch perception on pitch production. 
This is because in a tone language environment, the perception 
and production of lexical tones are basic communication needs of 
daily necessity. 

In research on speakers of non-tonal languages, perceptual 
pitch deficits have generally been associated with poor pitch pro- 
duction in music (Dalla Bella etal., 2009; Hutchins etal., 2010). 
According to the vocal sensorimotor loop model (VSL model. 
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Berkowska and Dalla Bella, 2009), perception is a necessary but 
insufficient element of vocal production. Motor components, such 
as motor planning and auditory-motor mapping, also play vital 
roles (Berkowska and Dalla Bella, 2009). In accordance with the 
assumptions of the VSL model, the existence of normal pitch 
perception may not necessarily preclude poor pitch singing, due 
to the possibility of independently deficient motor-related func- 
tions (Dalla Bella etal., 2007). The inclusion of both overt and 
covert perceptual components in the VSL model offers greater 
explanatory power. Within the amusic population, cases have 
been noted in which pitch perception impairments do not nec- 
essarily cause vocal production deficits (Loui etal, 2008; Dalla 
Bella et al, 2009). This could be explained by preserved covert but 
impaired overt pitch perceptual abilities; such an instance would 
corroborate the findings of previous mismatch negativity (MMN) 
studies on near-normal neural processing of fine-grained pitch dif- 
ferences without awareness in congenital amusia (Moreau etal, 
2009; Peretz etal., 2009). More specifically, the auditory cortex 
may function relatively normally in these amusics, but its con- 
nectivity with the pars orbitalis of the right inferior frontal gyrus 
may function aberrantly (Hyde et al., 2011). For the other amusics, 
however, it is very likely that their abnormalities reside not only 
in the fronto-temporal pathway but also in the auditory cortices, 
as shown by a more recent study using magnetoencephalography 
and voxel-based morphometry (Albouy et al., 2013). 

With regard to speech, however, linguistic tone deficits are 
not necessarily always correlated with production impairments, 
as suggested by a recent study showing that amusic speakers of 
non-tonal languages are unable to discriminate between speech 
intonations despite production being intact (Hutchins and Peretz, 
2012). A more recent study, in contrast, reported impaired speech 
and song imitation among Mandarin speaking amusics (Liu 
et al., 2013). This sometimes asymmetrical perception-production 
relationship between music and speech domains could not be 
explained by the apparent acoustical difference between musi- 
cal pitch and linguistic intonation - i.e., fine-grained for musical 
pitch but coarse-grained for linguistic intonation. As shown in one 
recent study (Dalla Bella et al, 201 1), after controlling for acous- 
tic pitch differences across domains, a young university student 
with intact musical pitch perception but impaired musical pitch 
imitation was shown to have intact linguistic tone production. 

Among tonal language speakers, similar perceptual lexical tone 
deficits have been observed among individuals whose lexical tone 
production is intact (Nan et al., 2010). However, pitch production 
in music (singing abilities) among Mandarin speakers, especially 
among amusic and tone agnosic Mandarin speakers, has not yet 
been fully examined (but see Liu etal., 2013 for impaired pitch 
production in both music and speech for Mandarin speaking amu- 
sics only). It should be noted that, so far, the exact nature of 
tone agnosia is not yet clear. As shown in our early work (Nan 
etal., 2010), the tone agnosics had little problem identifying lex- 
ical tones carried by the same segments (e.g., word onsets and 
rhymes), but they had difficulties in tones embedded in differ- 
ent segments. This might be due to low executive or attentional 
control in these individuals. In the current study, we controlled 
these factors by matching the control, the amusic, and the tone 
agnosic groups on measures of executive functions and working 



memory. The tone agnostics were thus also amusics but with 
additional perceptual lexical tone disorders, as the tone agnosics 
and the amusics demonstrated similar levels of music percep- 
tion deficits [as indicated by the similar melodic Montreal Battery 
of Evaluation of Amusia (MBEA) tests scores between these two 
groups]. 

In the current study, we tested musical pitch and lexical 
tone production among age- and IQ-matched amusics and tone 
agnosics (i.e., amusics who were, at the same time, tone agnosics) 
relative to normal controls, with three specific research aims. The 
first was to understand how musical pitch perception deficit is 
related to musical pitch production among Mandarin speakers. 
Second, we wanted to explore how the lexical tone impairments 
observed in tone agnosics would be related to musical pitch pro- 
duction relative to the other amusics who had intact lexical tone 
perception. Based on previous results of the dissociation between 
musical perceptual and productive abilities in amusia, we specu- 
lated that musical pitch production difficulties might be present 
in some but not all amusic participants. It is possible that the same 
holds true in tone agnosics, since they were also amusics. Alter- 
natively, tone agnostics might show more severe musical pitch 
production deficits compared to the amusics, if the lexical tone 
deficit were detrimental to musical pitch production. The third 
research aim was to test lexical tone production using a novel set 
of objective analyses. It is possible that the subjective rating system 
used in our earlier study (Nan et al, 2010) has been insufficient for 
detecting subtle lexical tone production deficits, especially among 
tone agnosics. 

MATERIALS AND METHODS 
PARTICIPANTS 

Sixteen amusics (six females) and twelve matched controls (five 
females) participated in the study. Among these 16 amusics, eight 
were also impaired in lexical tone perception and were thus identi- 
fied as tone agnosics. A summary of all participants' characteristics 
is provided in Table 1. All participants were university students 
in Beijing and native Mandarin speakers without formal musical 
training. They reported no vocal, neurological, or audiological 
deficits. Their binaural audiometric thresholds were at or below 
20 dB hearing level for octaves from 250 to 8000 Hz. Additionally, 
the controls reported no difficulty singing. Among all participants, 
23 were right-handed and five were left-handed, as assessed by 
the Edinburgh Handedness Inventory (Oldfield, 1971). Informed 
written consent was obtained from all participants. This research 
was approved by the Institutional Review Board of Beijing Normal 
University. 

All participants were assessed with the six tests of the MBEA 
(Peretz et al., 2003) and the lexical tone perception tests employed 
in our previous study (Nan et al., 2010). The MBEA includes three 
melodic pitch-based tests (scale, contour and interval), two time- 
based tests (rhythm and meter), and one memory test. All amusic 
participants scored below the cut-off score of 7 1 .7%, which corre- 
sponds to two SDs below the mean of the normal controls that was 
obtained in our earlier study (Nan etal., 2010). The lexical tone 
perception test contains identification and discrimination tasks. 
Among the 16 amusics, eight tone agnosics were identified based 
on the same criteria (i.e., performance below the cut-off scores of 



Frontiers in Psychology | Cognitive Science 



January 2014 | Volume 4 | Article 1024 | 2 



Yang etal. 



Pitch production amoung Mandarin amusics 



Table 1 |The characteristics of the controls, the amusics, and the tone agnosics with percentages of correct responses on the MBEA and lexical 
tone perception tests. 



Control (n = 12) 



Amusia (n = 8) 



Agnosia (n = 8) 



Mean age (range) 
Male/female 
Right/left handedness 
Performance IQ (SD) 
Verbal IQ (SD) 
Executive function (SD) 
Working memory (SD) 
MBEA mean (SD) 

Scale 

Contour 

Interval 

Rhythm 

Meter 

Memory 

Global 
Lexical tone mean (SD) 

Mean 

Discrimination (different segments) 



22.5 (19-26) 
7/5 

10/2 

115.8 (8.1) 
128.8 (6.0) 

13.6 (0.9) 

15.1 (1.9) 

91.7 (6.9) 
90.0 (8.4) 

85.8 (11.1) 
92.8 (6.2) 
81.4 (16.6) 

93.4 (4.0) 

89.2 (6.2) 

96.5 (4.1) 

94.3 (4.7) 



21.8 (20-25) 

5/3 

6/2 

111.3 (6.5) 
126.9 (5.5) 

13.3 (0.6) 
14.1 (1.2) 

60.4 (13.3) 

63.3 (6.7) 
60.8 (8.5) 

65.0 (13.1) 

60.4 (21.3) 
74.6 (9.6) 

64.1 (3.0) 

96.1 (2.8) 
92.6 (73) 



24.5 (19-28) 
5/3 

5/3 

110.3 (6.0) 
124.9 (8.0) 
12.8 (1.1) 

13.6 (2.2) 

574 (8.6) 
55.8 (11.1) 

60.4 (79) 
64.3 (11.9) 

65.5 (10.4) 
676 (76) 
61.8 (5.4) 

62.8 (12.6) 
65.0 (8.6) 



79.2% for the lexical tone discrimination test with different seg- 
ments - viz., word onsets and rhymes - and 80% for the average 
lexical tone perception tests, both of which correspond to three 
SDs below the means of the normal controls) as employed in our 
previous study (Nan et al., 2010). Except with respect to the lexical 
tone tests, the tone agnosics performed equivalently to the amusics 
on the MBEA tests (allps > 0.1). 

The amusic and tone agnosic groups were matched for age, 
handedness, performance IQ, and verbal IQ based on Wechsler 
Adult Intelligence Scale-Revised by China ( WAIS-RC; Gong, 1 992 ) 
with the control group (all ps > 0.1). Additionally, these two 
groups were also matched on measures of executive functions 
and working memory as derived from WAIS-RC with the control 
group (bothps > 0.1). Specifically, executive/attentional functions 
were indexed by the block design and similarities tests. The block 
design test taps attentional aspects of executive function (Chase 
etal., 1984; Kazui etal., 2011), whereas the similarities test may 
reflect abstraction and reasoning (Stern and Prohaska, 1996). The 
working memory index included arithmetic and digit span tests 
(Wechsler, 1981). 

MATERIALS AND PROCEDURES 
Music production test 

All of the musical stimuli were computer-synthesized with a piano- 
like timbre. There were two imitation conditions: one-note and 
three-note. One-note condition included 13 trials, each being one 
of the 13 notes (G3, A3, B3, C4, D4, E4, F4, G4, A4, B4, C5, D5, and 
E5). Each note lasted 500 ms. Three-note condition consisted of 
eight trials, with two trials for each of the four different directions 



("up", "down", "down up", and "up down") (Figure 1). Except for 
G3, A3, and D5, the remaining notes in one-note condition were 
also used in three-note condition. A trial in three-note condition 
lasted 2500 ms, including three 500 ms notes and two 500 ms gaps 
in between. 

Lexical tone production test 

A female voice actor who was a native Mandarin speaker pro- 
duced a list of one-syllable words for the lexical tone production 
tests. Recordings were made in a sound-proof booth using a Sony 
60EC digital recorder and an NTl microphone with a Samson 
MDR8 mixer, at a sampling rate of 44.1 kHz. There were two con- 
ditions: monosyllabic word production and trisyllabic nonsense 
word production. The monosyllabic word production test con- 
tained eight one-syllable words, with two lexical tones from each 
category (four categories: 1 = level, 2 = mid-rising, 3 = dipping, 
and 4 = high-falling). The trisyllabic nonsense word production 
test had eight trials, each formed by three one-syllable words. As 
a result, the trisyllabic word condition contained 24 one-syllable 
words in total, with six lexical tones from each tone category. 
For both lexical tone production conditions, the durations of the 
syllables (on which the lexical tones were carried) were not signifi- 
cantly different (mean ± SD for the monosyllabic word condition: 
537.5 ± 126.9 ms; for the trisyllabic nonsense word condition: 
531.7±131.6ms;p>0.1 between conditions) . As a result, a trial in 
the trisyllabic nonsense word condition lasted 2595.0 ± 225.6 ms 
(including two 500 ms gaps), about five times the mean duration 
of the monosyllabic word condition. The acoustic characteristics 
of the lexical tones for production tests are listed in Table 2. 
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Music direction production test 



up 



down 



down up 



up down 



Lexical tone production test 





ii (lal) 



# (nnu3) 



(zhao4) It (long2) 



Monosyllabic word Trisyllabic nonsense word 

FIGURE 1 I The exemplar stimuli used in the music three-note imitation test (A) and lexical tone production test (B). 



Procedure 

All of the words and musical stimuli were equalized for sound 
intensity with 10 ms linear onset/offset ramps using Praat 
(Boersma, 2001). All of the production tests were completed in a 
sound-treated booth in a single session. The stimuli were presented 
to the participants binaurally through Sennheiser HD 201 head- 
phones with individually adjustable volume. Participants were 
asked to vocally imitate the musical note sequences or the Man- 
darin words that they heard as closely as possible. They were to 
match not only pitch (for Mandarin words the accurate lexical 
tonal contours were emphasized) but also tempo of the target 



Table 2 j Acoustic characteristics of the lexical tones for production 
tests. 





Mean FO (Hz) 


FO range (Hz) 


Pitch glide 
size (Hz) 


Duration (ms) 


Tone 1 


288.5 (8.1) 


281.6-306.2 


24.6 (11.3) 


541.3 (91.3) 


Tone 2 


213.0 (23.1) 


177.9-289.2 


111.1 (20.3) 


580.0 (78.0) 


Tone 3 


156.6 (9.0) 


95.9-200.9 


105.0 (23.0) 


6475 (45.0) 


Tone 4 


234.9 (9.9) 


112.5-332.3 


219.8 (31.2) 


363.8 (79.5) 



The numbers in parentheses are standard deviations. 



Stimuli. The order of the music production test and the lexi- 
cal tone production test was counterbalanced across participants. 
Before each test, detailed instructions and a warm-up phase were 
given to ensure that all participants understood the task. In the 
music production test, the participants were encouraged to use 
syllable /la/ when imitating the note sequences. All imitation sam- 
ples were recorded onto a Marantz PMD-620 digital recorder 
(Marantz Professional, Itasca, IL, USA), with a sampling rate of 
44.1 kHz. The whole test lasted approximately 30 min for each 
participant. 



DATA ANALYSIS 
Music production test 

The fundamental frequency (FO) and duration of each produced 
note were extracted using Praat (Boersma, 2001) based on the 
identified steady-state phase of each sung note. Accordingly, three 
FO-based measures and one duration-related measure were calcu- 
lated: note deviation, interval deviation, direction accuracy, and 
duration deviation. FO measurements in hertz were converted to 
cents (100 cents = 1 semitone). 

Note deviation. Note deviation referred to the absolute difference 
between the produced FO and the target FO (e.g., Pfordresher et al., 
2010). Octave errors were corrected. 
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Interval deviation. Interval deviation was calculated as the abso- 
lute difference between the produced interval and the target 
interval for the three-note condition. 

Direction accuracy. Direction accuracy represented the rates of 
correctly produced directions in the three-note condition. A 
response was defined as correct if the successively produced 
three-note sequence shared the same direction as the target 
sequence. 

Duration deviation. Duration deviation indicated the average 
absolute differences of duration between the produced note and 
the target note (Dalla Bella et al, 2009). 

Note deviation and duration deviation were applicable for both 
one-note and three-note imitation conditions, whereas interval 
deviation and direction accuracy were entirely based on the three- 
note condition. 

Lexical tone production test 

Subjective assessment. Three independent raters (two female, 
native Mandarin speakers with a mean age of 24 years) classified 
each of the produced lexical tones from each participant as tone 
1, 2, 3, or 4. When correct, the lexical tone that was produced was 
considered a hit. For each participant, the average scores for the 
monosyllabic word and the trisyllabic nonsense word production 
conditions were calculated separately. 

Objective analysis. For each produced lexical tone, the mean FO, 
pitch glide size (i.e., the mean difference between minimum and 
maximum FO; Nan etal., 2010), and the duration were extracted 
using Praat. Octave errors were corrected for the mean FO when 
necessary. These measures were then compared to those of the tar- 
get lexical tones to calculate the mean FO deviation, pitch glide size 
deviation, and duration deviation. All these deviation measures 
were calculated as the absolute difference between the produced 
lexical tone and the target one. 

Statistical analysis 

For all ANOVAs, the assumptions of normality and homogeneity 
of variance were met. If violated, then the non-parametric alter- 
native to the planned AN OVA would be conducted instead. For all 
the repeated measures ANOVAs, however, an additional assump- 
tion of sphericity of the covariance matrix was also ensured. The 
Greenhouse-Geisser correction was applied when the sphericity 
assumption was violated. Bonferroni corrections were applied in 
multiple post hoc tests. 

RESULTS 

MUSIC PRODUCTION RESULTS 
Note deviation 

A mixed-model two-way repeated-measure ANOVA of note devi- 
ation with condition (2) as a within-subjects factor and group 
(3) as a between-subjects factor found a main effect of group 
[f (2,25) = 12.093, p < 0.001] and an interaction between condi- 
tion and group [f (2,25) = 5.607, p = 0.01]. As shown in Table 3, 
the controls significantly outperformed the amusics and tone 
agnosics (both ps < 0.0 1 ) , whereas the latter two groups performed 
similarly on note deviation (p > 0.5). Simple effect analysis of the 
observed interaction between condition and group suggested that 



Table 3 |The music production performances among the three groups. 



Control Amusia Agnosia 

Note deviatiotn (semitone) 1.9 (0.7) 2.9 (0.5) 2.8 (0.4) 

Interval deviation (semitone) 1.4 (0.7) 1. 5 (0.3) 1.6 (0.8) 

Direction accuracy (%) 98.9(3.6) 75.0(24.1) 70.3(24.0) 

Duration deviation (ms) 323.2(171.6) 253.5(96.0) 229.1(120.3) 

The numbers in parentheses are standard deviations. 

the controls performed significantly better in three-note imita- 
tion condition than in one-note imitation condition {p < 0.01), 
whereas the amusics and tone agnosics both performed similarly 
in these two conditions (bothps > 0.05). 

Interval deviation 

A one-way ANOVA of interval deviation revealed no main effect 
of group. All three groups performed indistinguishably on interval 
deviation [f(2,25) = 1. 102, p = 0.348; Table 3]. 

Direction accuracy 

Direction accuracy of the controls violated the normality 
assumption for ANOVA (one-sample Kolmogorov-Smirnov test: 
Z = 1.837, p = 0.002). The Kruskal-Wallis H test revealed 
a statistically significant difference between the three groups 
[H(2) = 10.772, p = 0.005], with a mean rank of 19.75 for the 
controls, 11.44 for the amusics, and 9.69 for the tone agnosics 
(Table 3). Pairwise comparison using Mann- Whitney U tests sug- 
gested that the controls significantly outperformed the amusics 
and tone agnosics (both ps < 0.01), whereas the latter two groups 
performed similarly on direction accuracy (p > 0.5). 

Duration deviation 

A mixed-model two-way ANOVA of duration deviation with 
condition (2) as a within-subjects factor and group (3) as a 
between-subjects factor did not reveal any significant main effects 
or interactions. The three groups performed equivalently on dura- 
tion deviation (Table 3), suggesting that, although some of the 
amusics and tone agnosics showed impairments on the frequency 
dimension of musical pitch production, their performances on the 
time dimension seemed relatively unaffected. 

LEXICAL TONE PRODUCTION RESULTS 
Subjective assessment 

Lexical tone production was highly accurate in all three groups, 
with 100% correct for both the controls and the amusics in both the 
monosyllabic and trisyllabic conditions. The tone agnosic group 
also yielded perfect lexical tone production scores for both the 
monosyllabic (99.0 ± 2.3%) and the trisyllabic (99.5 ± 1.4%) 
conditions. No main effects or interactions were observed in a 
mixed-model two-way ANOVA. 

Objective analysis 

Separate mixed-model three-way ANOVAs (2 conditions x 3 
groups X 4 tone categories) of the results of the objective acous- 
tic analysis, including the mean FO deviation, pitch glide size 
deviation, and duration deviation, did not show any significant 
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Table 4 |The results of the objective analysis for the two lexical tone 
production tests across the three groups. 



Control 


Amusia 


Agnosia 


Mean FO deviation (semitone) 



Tone 1 
Tone 2 
Tone 3 
Tone 4 



1.9 (1.7) 

2.5 (1.4) 
3.4 (0.9) 

2.6 (1.6) 



2.9 (1.7) 

3.3 (1.1) 

3.7 (1.4) 

2.7 (1.4) 



Pitch glide size deviation (semitone) 



Tone 1 
Tone 2 
Tone 3 
Tone 4 



0.7 (0.3) 
2.4 (0.9) 
5.7 (1.4) 
79 (1.7) 



Duration deviation (ms) 

Tone 1 92.8 (45.6) 

Tone 2 106.2(42.4) 
Tone 3 131.7(66.4) 
Tone 4 1078(68.6) 



0.6 (0.3) 
2.4 (1.3) 
6.7 (1.6) 
9.3 (2.1) 

78.8 (42.1) 
96.2 (30.5) 
113.2 (84.6) 
106.6 (577) 



1.9 (1.4) 
2.2 (1.1) 

4.0 (0.8) 

2.1 (1.5) 

0.9 (0.5) 
2.4 (0.8) 
5.4 (2.0) 
75 (2.3) 

94.0 (32.7) 
575 (39.5) 
91.6 (50.0) 

83.1 (35.5) 



The numbers in parentheses are standard deviations. Tone 1 is the ievel tone, 
tone 2 mid-rising, tone 3 dipping, and tone 4 high-falling. 

main effects or interactions involving group (see Table 4). This 
is consistent with the results obtained in the subjective assess- 
ments, suggesting that neither the tone agnosics nor the amu- 
sics were impaired in lexical tone production relative to the 
controls. 
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0 12 3 4 

Note Deviation for IVIusic Production (st) 

FIGURE 2 I A significant negative correlation between the melodic 
MBEA score and note deviation for music production tests among the 
three groups. A daslied line divides two distinct groups based on 
individual performances on note deviation for the controls, the amusic, and 
tone agnosic groups. 



correlation between the equivalent music production and per- 
ception measures (i.e., direction accuracy and MBEA contour 
score or interval deviation and MBEA interval score) either (both 
ps > 0.1). 

These results suggest that musical pitch perception is tightly 
linked to lexical tone perception, and musical pitch production 
and perception are significantly correlated (Figure 2). 



CORRELATION ANALYSIS 

Spearman's Rank Correlation analyses were conducted to exam- 
ine the relationship between musical pitch perception, lexical tone 
perception, musical pitch production, and objective lexical tone 
production measures across and within the three groups of par- 
ticipants. The subjective lexical tone production scores were not 
taken into account due to the ceiling effect. Pitch perception mea- 
sures included the melodic MBEA scores (averaged across the scale, 
contour, and interval tests) and the average lexical tone perception 
scores. Pitch production measures included note deviation, inter- 
val deviation, direction accuracy, and duration deviation for music 
as well as the mean FO deviation, pitch glide size deviation, and 
duration deviation for lexical tones. 

The results showed that the melodic MBEA scores were 
positively correlated with the average lexical tone perception 
scores across the three groups [rs(28) = 0.575, p = 0.001]. 
More importantly, the melodic MBEA scores were also sig- 
nificantly and negatively correlated with note deviation across 
the three groups (Figure 2), rs(28) = -0.695, p < 0.001. 
However, neither of these two correlations held within each 
individual group (all ps > 0.1). There was no significant 
correlation between pitch perception measures (both melodic 
MBEA scores and the average lexical tone tests scores) and 
other music pitch production measures (including direction accu- 
racy, interval deviation, and duration deviation) or lexical tone 
production measures (all ps > 0.1). There was no significant 



INDIVIDUAL-LEVEL ANALYSIS 

We conducted individual-level analyses based on note deviation 
and direction accuracy; we selected these two measures on account 
of their clear group differences. 

For all 28 participants from the three groups, K-means cluster 
analysis using note deviation yielded two distinct groups (good 
pitch imitation vs. poor pitch imitation). As shown in Figure 2, 
a subgroup of the controls (n = 7) imitated musical pitch with 
significantly smaller note deviations (around 1.4 semitones) than 
the rest of the controls and all the amusics and tone agnosics 
(around 2.8 semitones). 

On the other hand, K-means cluster analysis using direc- 
tion accuracy yielded two different distinct groups. As shown in 
Figure 3, all controls {n= 12), five amusics, and four tone agnosics 
demonstrated relatively better music direction production perfor- 
mance (i.e., the "good direction imitation" group, with direction 
accuracy around 95%), whereas three amusics and four tone 
agnosics fell into the other group (the "poor direction imitation" 
group, with direction accuracy around 50%). 

These results suggest that note deviation and direction accuracy 
are two different measures describing musical pitch production. 
Measured by note deviation, five controls and all the amusics and 
tone agnosics demonstrated poor musical production, whereas 
measured by direction accuracy, none of the controls and only 
subgroups (about half) of the amusics and tone agnosics showed 
poor musical production. 
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FIGURE 3 I Direction accuracy of three-note imitation across tlie thiree 
groups. A daslied line divides two distinct groups based on individual 
performances on direction accuracy of three-note imitation for the controls, 
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DISCUSSION 

Auditory perception and vocal production are closely related, 
although the exact nature of this link is still hotly debated (e.g., 
Lotto et al, 2009; Dalla Bella et al., 20 11 ) . The present study tried to 
determine to what extent perceptual pitch deficits across domains 
are related to pitch production difficulties in music and Mandarin 
speech. Our results showed that the amusics and tone agnosics 
both had musical pitch production difficulties. However, the per- 
ceptual pitch deficits across domains did not affect lexical tone 
production. 

The current study found that the controls outperformed both 
the amusic and tone agnosic individuals on note deviation and 
music direction accuracy. Moreover, across the three groups, note 
deviation performance was significantly and negatively correlated 
with the melodic MBEA scores, suggesting an association between 
musical pitch perception and production. These findings cor- 
roborates previous results found among speakers of non-tonal 
languages (Dalla Bella et al., 2009) and further supports the notion 
that pitch perception and pitch production are closely coupled in 
the music domain (Berkowska and Dalla Bella, 2009; Dalla Bella 
etal., 2011) by using data from speakers of a tonal language. Fur- 
thermore, there was also a close link between lexical tone and 
musical pitch perception, corroborating the notion that lexical 
tone deficits among tone agnosics are associated with musical 
pitch disorders (Nan etal., 2010). However, it should be noted 
that according to the current results, tone agnosics were not more 
impaired than the amusics in musical pitch production, suggesting 
that lexical tone perception deficits are not necessarily detrimental 
to musical pitch production. 

More interestingly, all participants with lexical tone percep- 
tion impairments demonstrated intact lexical tone production, 
corroborating our previous results (Nan etal, 2010). This dis- 
sociation between perception and production of linguistic tones 
among amusics is also in line with previous results obtained using 



speakers of non-tonal languages (Hutchins and Peretz, 2012). A 
recent event-related potential paradigm (ERP) study reports a 
similar dissociation between production and perception of lex- 
ical tones for Cantonese (Law etal, 2013), suggesting that the 
observed independence between perception and production for 
lexical tones is relatively robust. This is partly in line with results 
from a recent study, which showed partial support for domain 
specific pitch processing, but at the same time a close association 
between song and speech imitation performance (Mantell and 
Pfordresher, 2013). 

Nonetheless, neither the VSL model (Berkowska and Dalla 
Bella, 2009) nor the vocal-motor encoding theory (Hutchins 
and Peretz, 2012) can easily incorporate the distinct perception- 
production relationships for music and Mandarin speech that were 
observed in the current study. It is possible that pitch production 
in music and Mandarin speech involve independent but interac- 
tive systems, similar to the recently proposed dual routes for verbal 
repetition (Yoo etal., 2012). The production of lexical tones may 
mainly engage the acoustic-phonetic systems primarily relying 
on the high temporal resolution of the left hemisphere, whereas 
the production of musical pitches as well as cross-domain pitch 
perception (including both musical pitch and lexical tone percep- 
tion) may mainly require the high frequency resolution of the right 
hemisphere (Zatorre and Belin, 2001; Luo etal., 2006), although 
the left inferior frontal gyrus is often implicated in lexical tone 
(Hsieh etal., 2001) or lexical tone and music pitch perception 
(Nan and Friederici, 2013) in Mandarin speakers as well. Thus, 
this classical view of hemispheric asymmetries in spectral and 
temporal processing (Zatorre and Belin, 2001) may account for 
how Mandarin lexical tone production can be preserved simulta- 
neously with impaired production in musical pitch and impaired 
pitch perception across domains. 

Moreover, the current results provide more insights on the 
nature of congenital amusia from the perspective of music pro- 
duction. Both the amusics and tone agnosics performed similarly 
to the controls on three-note imitation when measured by inter- 
val deviation, but these two groups were significantly impaired 
relative to the controls as measured by direction accuracy. This 
corroborates the accumulating results on music perception: com- 
pared to the controls, the amusics are not necessarily impaired 
in pitch discrimination thresholds (Foxton etal., 2004; Tillmann 
et al, 2009; Albouy et al., 20 13), but they show relatively consistent 
difficulties in discriminating pitch direction (Foxton etal, 2004; 
Liu et al, 2010). Hence, the current study adds more evidence on 
the notion that the related deficits of congenital amusia may as well 
arise at a relatively higher stage of pitch processing, e.g., perceiving 
and producing pitch directions (for a review, see Stewart, 2011). 
Furthermore, the observed clear split of good and poor musi- 
cal pitch production groups among amusics and tone agnosics 
as measured by music direction accuracy is in line with previous 
results on the possible existence of subgroups within the amusic 
population (Dalla Bella etal., 2009; Nan etal., 2010), converging 
on the notion that congenital amusia is indeed a complex dis- 
order and often involves variously mixed presentations (Stewart, 
2011). 

It should be noted that, in the current study, all of the partici- 
pants' performances in musical pitch production (except direction 
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accuracy) were relatively lower when compared to previous sim- 
ilar studies (e.g., Amir etal, 2003; Watts etal, 2005; Dalla Bella 
et al, 2007; Pfordresher et al., 2007; Wise and Sloboda, 2008). This 
may be due to the effects of tasks and stimuli for the music pro- 
duction tests. First, despite using similar stimuli, our imitation 
task was more demanding compared to the tasks used in pre- 
vious studies (Amir etal., 2003; Watts etal, 2005; Pfordresher 
etal., 2007). These previous studies usually presented each stim- 
ulus several times during the test, whereas in the present study, 
all stimuli were presented only once. Second and more impor- 
tantly, for music production tasks, the present study used piano 
tones in a pitch range that was suitable for females but not for 
male participants. As we have more male participants {n = 17) 
than female participants (« = 11), this gender effect would have 
contributed to the overall lower musical production performance 
observed in the current study. However, it should be noted that, 
when gender was considered as an additional between-subjects 
factor, no significant main effect or interactions involving gender 
were found, probably due to the small sample size. Likewise, in 
lexical tone production task, female voices were used. This might 
also have caused male participants more difficulties in lexical tone 
imitation than females, although no gender effect was statistically 
significant. 

Additionally, it is important to point out the fact that the lower 
production performances we observed occurred not only for the 
pitch dimension but also for the time dimension, in contrast to the 
findings of previous research (Dalla Bella et al, 2009). Based upon 
the same criterion, the current study found an average duration 
accuracy of approximately 30% across the three groups, whereas 
a previous study (Dalla Bella etal, 2009) with a familiar melody 
reported average duration accuracy more than 90% among the 
controls and the amusics. Nonetheless, the current data demon- 
strated that the amusics as well as tone agnosics were not impaired 
relative to the controls in the time dimension for music produc- 
tion, corroborating the previous study (Dalla Bella etal., 2009). 
Together with data from musical pitch perception (Hyde and 
Peretz, 2004; Nan et al, 2010), the current results farther support 
the notion that pitch deficits in both perception and production 
related to congenital amusia mainly affect the frequency dimen- 
sion but not the time dimension (Hyde and Peretz, 2004; Dalla 
Bella et al., 2009; Nan et al, 2010). 

It should also be noted that a more recent study reported 
impaired speech and song imitation among Mandarin-speaking 
amusics (Liu etal, 2013), whereas our results showed that the 
amusics were only impaired in musical pitch production but not 
lexical tone production. This discrepancy of speech tone produc- 
tion in Mandarin-speaking amusics between our results and those 
of Liu et al. (2013) might be caused by different stimuli employed 
in these studies. The current study used speech stimuli which con- 
tained equal numbers of four lexical tones in Mandarin, whereas 
Liu etal. (2013) did not control for the number of lexical tones 
from each tone category. As shown in Table 2 (Liu etal., 2013), 
among the set of selected speech stimuli used in the experiment, 
there were 28 level tones (tone 1) and five dipping tones (tone 3). 
With such a high rate of level tones (28 among 60 syllables, almost 
half), the speech stimuli did not well represent Mandarin which 
has four main lexical tones. 



The observed intact lexical tone production among amusics 
and tone agnosics in our present study, however, might also be 
due to the fact that the current speech stimuli were mainly drawn 
from everyday materials. It is inevitable that the speech stimuli 
employed in the current study were more familiar to the partici- 
pants than the music stimuli. Over years of experience with daily 
production needs, amusics and especially tone agnosics might 
have learned how to produce speech tones despite their percep- 
tual impairments. But clearly the extent of their exposure and 
daily production pressure for music pitches is much lower. The 
resulted disparity of the learning processes between music and 
speech domains might thus also account for the intact lexical tone 
production but impaired music pitch production among amusics 
and tone agnosics relative to controls. Additionally, the timbre 
difference between the speech (human voices) and music stimuli 
(piano tones) might have played a role. As suggested by previous 
research (e.g., Leveque etal., 2012), piano timbre is more difficult 
to imitate than human voice. 

Nonetheless, it should be noted that although the FO devia- 
tions of the produced lexical tones in all the three groups were 
not negligible, the intelligibility of the speech words were not 
affected. This indicates different functional standards for pitch 
production in lexical tones than that in music. More interestingly, 
based on the current results, we might tentatively speculate that 
the pitch-related production skills necessary for intelligibility of 
speech (such as those measured by direction accuracy and interval 
deviation) are largely intact in the amusics and tone agnosics as 
tested in the current study, while the aspect that is more specific to 
music (for instance the one represented by note deviation) is the 
one that is most compromised. 

CONCLUSION 

Individuals with perceptual pitch deficits known as congenital 
amusics and tone agnosics represent unique opportunities to 
understand the intriguing relationships between pitch produc- 
tion and perception in music and language. The current study 
examined to what extent the perceptual pitch deficits involved in 
both the amusics and tone agnosics are related to pitch produc- 
tion difficulties in music and Mandarin speech among Mandarin 
speakers. For music, our results demonstrated that pitch pro- 
duction difficulties may be present in both the amusic and tone 
agnostic groups, resulting in significantly enlarged note deviation 
and decreased direction accuracy in these two groups relative to the 
controls. Moreover, tone agnosics were not more impaired than 
the amusics in musical pitch production, suggesting that lexical 
tone perception deficits are not necessarily detrimental to musi- 
cal pitch production. For language, on the other hand, all three 
groups were able to imitate lexical tones with perfect intelligibility, 
suggesting that the perceptual pitch deficits across domains may 
coexist with intact lexical tone production. Taken together, the 
current results imply that the perception-production relationship 
for pitch among individuals with perceptual pitch deficits may be 
domain-dependent. 
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